RELIEF Algorithm and Similarity Learning for k-NN
نویسندگان
چکیده
In this paper, we study the links between RELIEF, a well-known feature re-weighting algorithm and SiLA, a similarity learning algorithm. On one hand, SiLA is interested in directly reducing the leave-one-out error or 0− 1 loss by reducing the number of mistakes on unseen examples. On the other hand, it has been shown that RELIEF could be seen as a distance learning algorithm in which a linear utility function with maximum margin was optimized. We first propose here a version of this algorithm for similarity learning, called RBS (for RELIEFBased Similarity learning). As RELIEF, and unlike SiLA, RBS does not try to optimize the leave-one-out error or 0 − 1 loss, and does not perform very well in practice, as we illustrate on several UCI collections. We thus introduce a stricter version of RBS, called sRBS, aiming at relying on a cost function closer to the 0 − 1 loss. Moreover, we also developed Positive, semidefinite (PSD) versions of RBS and sRBS algorithms, where the learned similarity matrix is projected onto the set of PSD matrices. Experiments conducted on several datasets illustrate the different behaviors of these algorithms for learning similarities for kNN classification. The results indicate in particular that the 0 − 1 loss is a more appropriate cost function than the one implicitly used by RELIEF. Furthermore, the projection onto the set of PSD matrices improves the results for RELIEF algorithm only.
منابع مشابه
Analogy-Based Reasoning in Classifier Construction
Analogy-based reasoning methods in machine learning make it possible to reason about properties of objects on the basis of similarities between objects. A specific similarity based method is the k nearest neighbors (k-nn) classification algorithm. In the k-nn algorithm, a decision about a new object x is inferred on the basis of a fixed number k of the objects most similar to x in a given set o...
متن کاملارائه مدلی غیرپارامتریک با استفاده از تکنیک k- نزدیکترین همسایه در برآورد جرم مخصوص ظاهری خاک
Soil bulk density measurements are often required as an input parameter for models that predict soil processes. Nonparametric approaches are being used in various fields to estimate continuous variables. One type of the nonparametric lazy learning algorithms, a k-nearest neighbor (k-NN) algorithm was introduced and tested to estimate soil bulk density from other soil properties, including soil ...
متن کاملOptimized Seizure Detection Algorithm: A Fast Approach for Onset of Epileptic in EEG Signals Using GT Discriminant Analysis and K-NN Classifier
Background: Epilepsy is a severe disorder of the central nervous system that predisposes the person to recurrent seizures. Fifty million people worldwide suffer from epilepsy; after Alzheimer’s and stroke, it is the third widespread nervous disorder.Objective: In this paper, an algorithm to detect the onset of epileptic seizures based on the analysis of brain electrical signals (EEG) has b...
متن کاملLearning Vector Quantization With Alternative Distance Criteria
An adaptive algorithm for training of a Nearest Neighbour (NN) classifier is developed in this paper. This learning rule has got some similarity to the well-known LVQ method, but using the nearest centroid neighbourhood concept to estimate optimal locations of the codebook vectors. The aim of this approach is to improve the performance of the standard LVQ algorithms when using a very small code...
متن کاملComparison of Distance Metrics for Phoneme Classification based on Deep Neural Network Features and Weighted k-NN Classifier
K-nearest neighbor (k-NN) classification is a powerful and simple method for classification. k-NN classifiers approximate a Bayesian classifier for a large number of data samples. The accuracy of k-NN classifier relies on the distance metric used for calculating nearest neighbor and features used for instances in training and testing data. In this paper we use deep neural networks (DNNs) as a f...
متن کامل